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ABSTRACT 



A simulation study of library-based information retrieval systems 
is described. Basic models for each of several important aspects are 
presented: user behavior, emphasizing response to quality and delays 

in services; the scheduling of services and the organization of the 
machine -readable files; the distribution of conventional library materials 
Many of the variables in the model (e.g. , user expectations of service 
time, delays involved in providing service, etc.) are random. An 
adaptive element is present in the form of admitting changes in user 
expectations in response to system changes. The need for frequent model 
change and the merging of component models into a comprehensive model 
for the entire system is established as the basis for use of a user- 
oriented simulation language. Statistical facility beyond that supplied 
in the language used (GPSS/360) is required leading to an inquiry into 
the ingredients of a supplementary program package to achieve the goals 
of this and related investigations. 
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AN OVERVIEW 



This discussion is intended to provide an overview of some digital 
sifnula txon efiorts in a very large area' of study roughly characterized by 
the terms: information retrieval networks or library-based information 

retrieval systems. 

Previous Studies 

A word about the context in which the present study is found is use- 
ful. In the early days of modelling of information storage and retrieval 
systems, researchers dealt primarily with cost and timing factors for the 
automated subsystem. Studies of this type include those of Bourne and 
Ford (1964} and Blunt et. al. (1966). Concern has spread to other aspects 
of the study: the role of the user in the information process (Reilly and 

Hayes, 1967); the environment in which the automated system is found in- 
cluding the interface with non-automated systems (the library for us) and 
the role of policy and funding decisions (Baker and Nance, 1968). Most of 
these efforts have at least toyed with the idea of a "general" model ap- 
plicable to a reasonably wide-range of systems of a certain type. We shall 
see, however, that difficulties have arisen in this attempt, when we dis- 
cuss one of these "general” models later. The task of achieving generality 
becomes more acute when the scope of the model becomes larger. Generality 
in the sense of macro components seems to be the only hope here; we shall 
meet with this concept again later. 
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Aspects for Modelling 

The additions that we must make in orckr to account for features 
beyond the automated systems extend the scope of th entire project to 
include the following: 

users: people, seeking information; 

a computer-processing center (or centers) ; 

storehouses of conventional library materials (books, etc.) 

When it comes to users we may think of user-computer dialog — the on- 
line systems so much discussed today. Alternatively, we may think of the 
user involved in the business of information gathering: creating demands 

for services and experiencing alternately frustration and satisfaction 
when services are not or are provided. 

When it comes to computers and information retrieval we may think of 
file management, e.g., strategies for file organization, the maintenance 
and updating of the file, and the kinds of retrieval services to be pro- 
vided. Also we may think of problems of scheduling of services: the query 

input rate and the flow of queries through the system. 

When it comes to storehouses of conventional materials we may think 
of now-prosaic self-inquisition, the favorite among information specialists: 
"Where are we going to put it all?" We may think of the libraries f need to 
adopt microreproduction for space-saving and widespread storage of materials 
of book length, journals, and report literature. Alternatively, we may 
imagine a large central library with comprehensive holdings in all forms 
and connected by facsimile and microwave transmission to local libraries. 
(Knowledge of problems in this area and the proposed solutions may be ex- 
pected to be fairly widespread since specialists outside of the library and 
information science community have put forth their views (e.g., Kemeny, 

1962; Brown, et. al., 1967).) 
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We have developed models in each of the three areas just mentioned.. 

We have not yet incorporated every aspect in each of these. We have, how- 
ever, included most of the features that Baker and Nance (loc. cit.) re- 
ported as lacking in previous studies. The models have already been used 
in discussions and for suggesting experiments in connection with a state 
library network in California (Reilly, 1968a) and with a medical library- 
network (to be published) . A full-scale experimental and simulation project 
for the latter area is envisioned for the future. 

Our objective from the beginning has been to be able to use the system 
of component models in an integrated fashion. A couple of difficulties 
must be overcome in so doing. The first and most obvious one is that the 
extensiveness of the effort is almost prohibitive; the experimental work by 
Goodman et. al. (1966) and Goodman and Jones (1968) on information-seeking 
habits of scientists and technicians in industry and the work by Blunt et. 
al. (1966) on the simulation of the (subsystem) information retrieval proc- 
essing center attest to the truth of this statement. Even with the (strict- 
ly) subsystem models we must compromise on many matters: we must classify 

individual material items; we must group users; we must restrict our atten- 
tion to only part of the "information process” (Reilly, 1968b). The second 
difficulty is that we must contend with disparate event- time scales for 
each of the portions of the systems, as illustrated in Figure 1. 



EVENTS 



TIME RANGE 



computer processing 

user decisions 

ordering and delivery of 
materials 



nanoseconds to milliseconds 
seconds to minutes 
minutes (several) to days 



Figure 1: Illustration of time scales relevant to activities in 

an information retrieval network. 



Part of the solution to the first of the difficulties has already been 
mentioned. The rest of the solution to that difficulty and to the second 
difficulty. lies in the development of a hierarchy of models similar in con- 
cept to that mentioned by Jones (1968) . The results from running a model 
at one (smaller- time scale) level are written into disk data sets and made 
available to the next higher level model. The necessity of working with 
the small-time events first is apparent. The data developed in the lower- 
level models are in the form of histograms and system-monitoring statistics 
(numbers) for decision processes that transactions moving through the 
higher-level model blocks may utilize. The models are designed to be run 
back-to-back with no programmer intervention, if so desired. 



COMPUTER PROCESSING CENTER MODEL 



A discussion of the full effort can begin with the model of Blunt (loc. 
cit.), the main focus of which is scheduling within the computer processing 
center (see Figure 2). Some terms that can be applied to this model, which 
indicate its scope, are the following: 

multiple entry points for queries (possibly remote) ; 
multiple entry modes (telephone, courier, etc.); 
ditto for the exiting of responses 

activities such as card punching, card- to- tape conversions, 
card/tape transporting; 

macro-level processing times (e.g., high and low levels of 
file search, editing, output, eto) ; 

error detection and correction (rewriting) ; 

availability times for input and system facilities. 

Implementation 

We have programmed a model of this type. The program, in GPSS con- 
sists of two separate jobs, an initial job which writes a magnetic tape 
(a so-called JOBTAPE) and a follow-up job that processes the tape. 

Basically, three types of data are assigned to parameters of trans- 
actions (the queries) in the first program: identification information; 

delay times at various service units; numbers for logical and probabilistic 
routing of transactions through the service units. (Other parameters, 
called auxiliary parameters, are used in the tape writing programs; but, 
to a large extent, their use constitutes a program facilitation detail that 

we need not discuss here.) The information that goes into the parameters 
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Figure 2: 



Gross level chart of information retrieval center 
processing events. Model adapted from C. Blunt et. al. 5 
1966. 
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is derived Jsy FUNCTIONS and VARIABLES from the query code (or generaliza- 
tions thereof, our auxiliary parameters), illustrated in Figure 3. Func- 
tion numbers for delays, probabilities for errors and routing, etc. are 
assigned from parsing the query code (e.g., a ”1” in position number 2 of 
the code indicates the need to use certain "low" (shorter) time functions 
for delays in searching, sorting, editing, etc.). 

The program itself has two "control" loops (one for timing and the 
other to restrict query production to certain times of the week) . The 
only limit on the number of days to be simulated in any one run is the 
user’s budget; the number of days simulated is controlled entirely by a 
single control card. The other "control" loop turns logic switches on 
and off according to the schedule desired; the switches control whether 
or not queries that are generated are allowed to enter the system. The 
"guts" or main portion of this program contains the parameter assignment 
statements. These statements are situated after the GENERATE blocks (we 
have one generate block for each query type) for assignments that are 
unique to each query type. Another section of code that is branched to 
by all types of queries provides a list of assignment statements shared 
by all the different kinds of queries. 

The second program reads the JOBTAPE provided by the first. The 
JOB TAPE contains the relative time at which each transaction is to enter 
the system. After entering, each transaction follows its allowed path 
through the system; this path may be tortuous since branching at certain 
points is probabilistic. Figure 4 illustrates the kinds of delays, error 
processing, etc. experienced by query- type-1101 transactions. Most of 
the rectangular blocks consist of a series of statistics gathering blocks, 
facility or storage entities, and one or more TRANSFER blocks. (Basi- 
cally, three separate strings of code could model almost all of these 
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QUERY 

CODE 




T 

MEANING 


SITE 


METHOD OF 
INQUIRY 


SEARCH 


OUTPUT COMMUNICATIONS 


1101 


A 


Telephone 


Low 


Courier/Data Link 


1102 


A 


Courier 


Low 


C our ier 


1201 


A 


Telephone 


High 


Courier/Data Link 


1202 


A 


Courier 


High 


Courier 


2201 


B 


Telephone 


High 


Courier/Data Link 


2202 


B 


Courier 


High 


Courier 


2303 


B 


Teletype 


Lo-Hi 


Data Link 


3101 


C 


Telephone 


Low 


Courier 


3102 


C 


Courier 


Low 


Courier 


3103 


C 


Teletype 


Lo-Hi 


Data Link 



Figure 3: Query types, from paper by Blunt, et. al. (1966) 
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Transfer to Tape or 
3nter Cards to Computer 



Initialization of CPU 
Processing. 



Determination of 
Search Detail. 



Join CPU Queue. 



File Search 



On-line Correct (EVA- 
& EVB65) . 

Sort/Edit (EVA63) 

S ummary (EVA6 4) 



Output Messages. 
Transport for Correc- 
tion (EVA82) . 



Remote Print (EVA52) . 
Off-line Print (EVA 51) . 
Off-line Print (EVA42) . 
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Transport Output for 
Delivery. 

Courier Delivers 
Results . 

User Receives Results. 

Figure 4: Pathway of Query Type 1101 through the system. 
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blocks; this Implies that three basic macroes could be used to write the 
entire system description.) The triangularly-shaped blocks are the trans- 
fer blocks; it can be seen that they provide for alternate pathways and 
for the possibility of rerouting (for error correction) . Once mere "con- 
trol” loops (for time and for facility, etc., availability) exist. Another 
loop exists for unscheduled breakdowns, represented generally as a facility*? 
preempt states. 

The output from this model develops in snaps taken at various intervals. 

So far we have utilized the normal GPSS output: 

FACILITIES 

average utilization 
number of entries 
average time of transit 
seizing transaction 

STORAGES 

maximum contents 
average contents 
capacity 

average utilization 
number of entries 
average time of transit 
current contents 



QUEUES 

maximum contents 
average contents 
total entries 
zero entries 
percent zeros 
average time of transit 

»» »» t» IT 

table number 
current contents 

It is simple to detect where bottlenecks occur in the system with this 
kind of output. We have run a model similar to Blunt r s example model and 
have achieved somewhat similar results. Some effort is being put into the 
task of making the two models achieve almost identical results. Simple 
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alternative models have been tried out (e.g., additional availability time 
for the CPU). Addition of cost aspects of the model is being investigated 
by a student. These changes and additions could well amount to be 
substantial. They are not, however, the full range of alterations (see 
below) . Reports describing this effort and the results from simulating 
the alternative cases will be available in the not too distant future. 
Extensions 

A simple reprogramming of a model like the Blunt model in GPSS 
strengthens it some: statistic- gathering blocks can be added most easily; 

restrictions on generality of certain variables and functions in the older 
model are readily cast aside, etc. More substantial additions consist of 
admission of a larger number of sites, and the facilities, etc. for 
handling them, preparing for further types of input/output (e.g., on-line 
query) facilities, etc. (Some of these changes can be "grafted" to the 
second (system) model without changing the first JOBTAPE- writing program.) 

A substantial change of critical importance to us is merging it with 

another model for a computer-processing center that has also been implemented 

in a simple form (see Figure 5 for a gross-level flow chart of this second 

model) . This second model turns toward a more automatic scheduling system 

as opposed to the external rigidly- imposed scheduling system in the Blunt 

model. The new (merged) model will have a number of new features: 

update input as well as query input. 

a priority system for update and queries depending 
on factors such as importance and size (updates) ; 
professional level, site of input, etc. (queries). 

automatic reallocation of priorities by the system. 

output of messages into the ordering- and- delivery system. 

The merging of these two models represents a major test of our assumption 

that programming- change ease in GPSS offsets any difficulties we may encounter 

in using GPSS, e.g., in programming more advanced statistics. 
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Figure 5: Gross level chart 

subsystem. Files 



of model for file organization and update 
are activity organized. 



order-and-delivery system model 



A model that we discuss very quickly is the ordering-and-delivery 
system (see Figure 6) . This system might also be called the conventional- 
materials distribution system since it is concerned to a very large extent 
with just that. Near completion is a comprehensive four-library version 
representing a bimodal network (e.g., the University of California). Some 
of its interesting features are: 

notion of partial satisfaction of a need. 

the utilization of the user- model (to be discussed next) 
in a mode appropriate to partial satisfaction and as 
an integral part of the model. 

input for computer processing center model of the type 
discussed earlier. 

the representation of library holdings in a very detailed 
manner using matrix savevalues for probabilities that 
a material is presently available. 

a four-level description of request types (fact, one 
material item, a survey, and exhaustive search) 
analogous to Goodman, et. al. (loc. cit.) . 

A nine-library version of this model is also being considered. It would 

allow a less schematic representation of the University of California 

library. 




Figure 6: Gross level chart of model for taking account of distribution 

of materials in the network and for fractional completion of 
request requirements. Each library in the network is repre- 
sented by an array of probabilities which give the probability 
of that library T s meeting a request in a given subject area. 

An array of this type exists for four depths of service. The 
subject area of the request is determined and by invocation 
of a search strategy the path for satisfying the request is 
traced from library to library with timing of delivery of 
materials and search affecting actual service time. 
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USER-BEHAVIOR MODEL • • 

In this section we discuss a (limited) version of the user-behavior 
model. One limitation that we impose (for discussion purposes) is that we 
are concerned with only one type of service, e.g., books. 

Formulation of the Model 

We first postulate the existence of a need time (NT) for our users, and 
a (actual) service time (AST) for our (one) material; these are both dis- 
tributed quantities. We may also postulate that if, for any request, 

AST < NT 

at least one impediment to successful service has been removed. A second 
ingredient to success is convenience; if we get what we want when we want 
it only at the price of great effort we are not going to be happy. For 
simplicity, let the condition that AST ^ NT be the only condition that de- 
termines user satisfaction. 

Let us assume a mechanism of the following type for our user. Let him 
be partly "irrational” : if he gets poor service on an occasion he is less 
likely to consider the library as a good place to go the next time he needs 
information. Let this emotion be dubbed the consideration probability (a 
threshold in the model) , and be denoted by CON. 

Let our user also be partly T !rationar. Let him retain an estimate of 
service times on the basis of his experiences (or on someone else r s author- 
ity). Let this record consist of a parameter, the estimated service time, 
EST, interpreted as a threshold ov as the mean of distribution whose dis- 
persion he knows. 

'‘ii " ^ 
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It should be clear from the above statements that we expect both CON 
and EST to change with experience (i.e., with each interaction with the 
library). Assume one need per one time unit and as a starting' point, the 
following change rules: 




ESX t + x 




(1 - 9) CON t + 9 
(1 - 9) CON t 



if E 

o 

if E s 
if E p 




A EST. + B 
r 

VC EST t + D 



if E 

o 

AST t if Eg 

ASX. if E_, 
t F 



where the event E is a no-request event, E c is a successful request, and 
Ep is a failed request; 9 is a constant (0 ^ 9 ^ 1) ; A, B, C, D are con- 
stant, not restricted in value. The CON change rule is exactly the same 
as that in the Linear Model of Mathematical Learning Theory (see Hilgard 
and Bower, 1966) . A high value of 9 produces a fairly stable set of values 
for CON; near-unity values of A and B (e.g., .9) coupled with near-zero 
values of B and D (e.g., .1) produce a fairly stable set of EST values. 

Taken together the statements of the previous paragraphs provide us 
with the user-behavior model presented in Figure 7. One other element of 
the model (not depicted in Figure 9) is the matter of costs and returns for 
various information policies that the user can invoke. We shall discuss 
this more below. 

Elementary Model Results 

Let us discuss some of the elementary results obtained from running a 
model of this type. Specific values for parameters must be assumed. (Rela- 
tive values are of most importance in a comparative analysis.) For 0, 
we use the value .1; different values of A, B, C, D were used as indicated 
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Figure 7: Gross level chart of user-behavior sybsystem model 
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below. Initial values for CON and EST must also be assumed; a range of 
values of these were also utilized. 

Figure 8 shows how the final value of the mean of CON is independent 
of the initial value — a fact that can be shuv/n mathematically. A similar 
phenomenon obtains for EST. 

Figure 9 illustrates the distribution of AST along with two EST distri- 
butions for cases when A = C = .9 and B = D = .1 and v;hen A=-B= : C=D=5, 
respectively. It can be seen that the EST distribution is con- 
fined within that for AST. This same result can be exhibited in the inter- 
val-by-interval plot of the values of EST and AST. This is due (quite 
clearly) to the fact that EST t + , is, with the particular rules we have 
chosen here, a weighted average of the new AST value with the old EST and 
thus must always result in a value between these two quantities. 

Evaluation Procedures 

One of the purposes of estimating service times is to avoid making- 
requests when there is a low probability of success. The effects of various 
policies can be seen in terms of a plot (or table) of values for needs, 
considerations, requests, and successes. Such a curve is presented in 
Figure 10. Figure 11 provides a rundown on these items in several cases: 

1) CON mechanism only; 

2) EST mechanism only; user estimates EST f s mean; a dis- 

tribution about the mean is provided and used in a 
purely guessing fashion. 

3) CON and EST, the latter used as in- the previous case; 

4) CON and EST, the latter sectioned with user knowing 

when to expect long and short service times. 

Evaluation in terms or costs and returns is useful. The costs for 
fulfilling requests should be relatively easy to represent in the model 
since they are related to system utilization. Returns, seemingly, are 
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Figure 8 : Probability of consideration to use the library network (averaged over 

types of needs as well as over the users) as a function of time, with different 
initial values. 
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Figure 10: Number of needs, considerations to use the 

and responses meeting the needs. 
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GUESS 
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CONS 
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REQS 
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SUCCESS 
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310 


454 



Figure 11: Needs, considerations, requests, and successes for 

four cases. (See text for fuller description of 
the cases.) 
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not easy to express. The exact treatment of them is not, however, the 
issue here. Our concern is this: "Given a cost/return paradigm, how valu- 

able, is it for the user to be able to accurately predict the AST?” For 
illustrative purposes only, we assumed a fixed cost (e.g., a mean value) 
for each request and a fixed return for each successful request. The 
values that we used implied a fairly high overall returns rate (20-40%) . 

A high rate might be defended by analogy to that associated with obtaining 
or extending education. The results of our analysis are illustrated graph- 
ically in Figure 14 for the following four cases: 

1) EST used in a guessing mode, as described above; 

2) EST as a threshold (no distribution) ; 

3) a fixed threshold (with various assumed values) ; 

4) EST with expectations of long and short service times. 

» I 

GPSS AND STATISTICS 

We have already seen a bird r s-eye view of the statistics provided 
automatically by the GPSS program. It just happens from the nature of many 
models (e.g., our Blunt-like model) a great deal of mileage can be had 
without recourse to further statistical effort. In other eases correlations 
between variables, etc. are essential to the expression of model results. 

The quickest path to more statistical power in GPSS is to link it to 
the FORTRAN libraries of statistical routing (e.g., the Scientific Sub- 
routine Package (SSP) of IBM, the UCLA Biomedical statistical programs 
(BMD) , etc.). In order to make the link successfully it is necessary to 
do a certain amount of programming in IBM 360 Assembly Language in context 
of the GPSS HELP block. Let us briefly discuss how this might be done for 
a particular example. The program -chat we wish to link to is a FORTRAN 
chi-square program. The data to be analysed is contained in the parameters 
of the "trigger" transaction (the one that goes through the HELP block) . 



- 24 - 



We begin with the SPSS program in which the following statement is en- 
countered: 

HELP EFCHI 

This brings into core a load module consisting of an Assembly Language 
Interface and the (compiled) FORTRAN chi-square subroutine and any routines 
it may call. Control transfers to this program. 

The first stage in the Assembly interface is spent in going after the 
parameters of the "trigger” transaction. (GPSS provides a set of control 
words, located through Register 10, which allow us to get at these data.) 
Before we can do anything with any parameters we must first take into ac- 
count whether they are half- or full-words. Having done that we can move 

* 

out the first two numbers giving the size of the data array (N, the number 
of rows and M, the number of columns). We also at this stage move X*^ 1 or 
X T 58 T into a location symbolically addressed by the name OPCODE. OPCODE • 
sits in the middle of a line of code at the very point a L or LH is required 
(inside a loop) to move individually each parameter out of the GPSS storage 
location into a register. With the parameter value in this register a 
branch is then made to a GPSS-provided routine (FLOAT) for converting it 
from fixed point to floating point. Upon return from this routine the con- 
verted parameter value is stored in an area previously set aside for it in 
another (GPSS) routine, GETCOR. The address of the area where the con- 
verted parameter ends up was set, upon return from GETCOR in AAD. AAD is 
one member of the "parameter list" (see Figure 13) required by the FORTRAN 
chi-square program. At the end of the data conversion stage control passes 
to chi-square. Finally, chi-square T s output data must be reconverted apd 
stored in the parameters of the "trigger" transaction (or in savevalues) 
through a series of statements that reverse those just outlined. 

The above example describes the situation in a particular case. An 
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PARAM 






AAD 



* 

CSAD 

NDFAD 



* 

TRAD 

TCAD 



EQU * 

INPUT 

DS F address of array of chi square input data 

DC A(N) address of N the number of rows 

DC A(M) address of M, the number of columns 
OUTPUT 

DC A(CS) address for CS, chi-square 

DC A(NDF) address of NDF, the number of degress of freedom 

DC A(IERR) address of error code 

WORK 

DS F set in program 

DS F set in program 



Figure 13: Parameter list for FORTRAN chi-square routine. 



interesting problem arises if we wish to pass other GPSS data types (e.g., 
parameters, savevalues, matrix savevalues, block counts, entity attributes) 
tc the assembly program.' (It may be feasible to short circuit this 
need for generality by passing all data to parameters and savevalues in the 
GPSS program before passing into the Assembler phase; however, this may not 
always be possible.) A solution to this problem seems possible through 
introduction of general HELP block, STAT , the fields of which contain pro- 
per control information for linking to any data type. It is also necessary 
to have information in either these fields or elsewhere to denote the sta- 
tistical routine that is required. The "elsewhere" most likely is the 
parameters of the triggering transaction. In order to utilize such a system 
it would be necessary to develop a user's manual (similar, in some ways, to 
those developed for BMD and SSP) describing how to set up the fields of this 
block and what numbers to assign to the parameters of the incoming "trigger" 
transaction. An example, in which the origination of the data, the desired 
routine, and the size of the input arrays are set up in the fields of STAT 
is the following: 

HELP STAT, 1101, 1001, 1901, 3, 3 

where: A = STAT = name of the HELP routine 

B = 1101 = chi-square test (non-parametric tests = 1100) 

C = 1001 = input data is in savevalues (1000) ; first data 
item is in XI 

D = 1901 = output data is to be returned to savevalues 
(1000) starting at X901. 

E = 3 = number of rows in input table 

F = 3 = number of columns in input table. 

The Assembly code would have to be supplied with information to decode 
these (coded) values and to set switches as to where to branch for calling 
the proper subroutine. 
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On-line system user 
M = 238 

S t = 66 



Off-line system user 



M t = 275 
S t = 67 





Figure 14- : Example of prediction tree* depicting service time 

means and standard deviations as a function of 
information known about the user. 



* 




In passing we would like to note that linking to PL/1 seems desirable. 
A program for manipulation of prediction trees is possible utilizing the 
list-processing capabilities of PL/1. An example of the kind of output to 
be realized in such a case is that shown in Figure 1M-, wherein the response 
time mean and standard deviation are predicted according to amount of in- 
formation we have on the user. Many of the basic PL/1 capabilities for 
manipulating such trees have been investigated as has the linkage from 
GPSS. We should soon have a working version of such a program. 
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